Maximum Likelihood Estimation in Latent Class Models For Contingency Table Data

نویسندگان

  • Stephen E. Fienberg
  • Patricia Hersh
  • Alessandro Rinaldo
  • Yi Zhou
چکیده

Statistical models with latent structure have a history going back to the 1950s and have seen widespread use in the social sciences and, more recently, in computational biology and in machine learning. Here we study the basic latent class model proposed originally by the sociologist Paul F. Lazarfeld for categorical variables, and we explain its geometric structure. We draw parallels between the statistical and geometric properties of latent class models and we illustrate geometrically the causes of many problems associated with maximum likelihood estimation and related statistical inference. In particular, we focus on issues of non-identifiability and determination of the model dimension, of maximization of the likelihood function and on the effect of symmetric data. We illustrate these phenomena with a variety of synthetic and real-life tables, of different dimension and complexity. Much of the motivation for this work stems from the “100 Swiss Francs” problem, which we introduce and describe in detail.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multivariate Ordered Logit Regressions

In this paper we combine recent advances in marginal modelling for contingency tables with the notion of copula to formulate a class of models for describing how the joint distribution of a set of ordinal response variables depends on exogenous regressors. We derive the main properties of a marginal parameterization, the global interaction copula, whose nature is essentially non parametric, and...

متن کامل

Reduced rank models for contingency tables

In recent years much attention has been given to models for two-way contingency tables that can be formulated in terms of reduced rank of a matrix with probabilities. A well-known reduced rank model is the independence model, where the rank is one. For rank higher than one distinct classes of reduced rank models are possible. Each has the independence model as the special case for rank one. A f...

متن کامل

A comparison of algorithms for maximum likelihood estimation of Spatial GLM models

In spatial generalized linear mixed models, spatial correlation is assumed by adding normal latent variables to the model. In these models because of the non-Gaussian spatial response and the presence of latent variables the likelihood function cannot usually be given in a closed form, thus the maximum likelihood approach is very challenging. The main purpose of this paper is to introduce two n...

متن کامل

poLCA: An R Package for Polytomous Variable Latent Class Analysis

poLCA is a software package for the estimation of latent class and latent class regression models for polytomous outcome variables, implemented in the R statistical computing environment. Both models can be called using a single simple command line. The basic latent class model is a finite mixture model in which the component distributions are assumed to be multi-way cross-classification tables...

متن کامل

poLCA: Polytomous Variable Latent Class Analysis Version 1.2

poLCA is a software package for the estimation of latent class and latent class regression models for polytomous outcome variables, implemented in the R statistical computing environment. Both models can be called using a single simple command line. The basic latent class model is a finite mixture model in which the component distributions are assumed to be multi-way cross-classification tables...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007